Improving the kelly-lochbaum vocal tract model using conical tube sections and fractional delay filtering techniques

نویسندگان

Vesa Välimäki

Matti Karjalainen

چکیده

An articulatory model of speech production is usually constructed by approximating the profile of the vocal tract using cylindrical tube sections. This is implemented by a digital ladder filter that is called the Kelly–Lochbaum model. In this paper we propose an extended approach, where the tube sections approximating the profile of the tract are conical instead of cylindrical. Furthermore, the length of each tube section in our model can be accurately controlled using a novel fractional delay filtering scheme. These refinements result in an accurate and intuitively controllable vocal tract model that is well suited for articulatory speech synthesis.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Articulatory speech synthesis based on fractional delay waveguide filters

An extension to the traditional Kelly-Lochbaum vocal tract model is introduced. In the new model not only the diameter but also the length of each tube section can be continuously adjusted. This is achieved by using fractional delay filter techniques such as interpolation and deinterpolation. The filter structure consisting of bidirectional delay lines (digital waveguides) and interpolated port...

متن کامل

Articulatory Vocal Tract Synthesis in Supercollider

The APEX system [1] enables vocal tract articulation using a reduced set of user controllable parameters by means of Principal Component Analysis of X-ray tract data. From these articulatory profiles it is then possible to calculate cross-sectional area function data that can be used as input to a number of articulatory based speech synthesis algorithms. In this paper the Kelly-Lochbaum 1-D dig...

متن کامل

Mixed physical modeling techniques applied to speech production

The Kelly-Lochbaum transmission-line model of the vocal tract started the discrete-time modeling of speech production. More recently similar techniques have been developed in computer music towards a more generalized methodology. In this paper we will study the application of mixed physical modeling to speech production and speech synthesis. These approaches are Digital Waveguides (DWG), Finite...

متن کامل

Estimation studies of vocal tract shape trajectory using a variable length and lossy kelly-lochbaum model

This work demonstrates the use of a modified KellyLochbaum (KL) vocal tract (VT) model in dynamic mapping from speech signals to articulatory configurations. The sixteen section KL model is equipped with a variable length segment for lip rounding and an accurate model for lip radiation impedance. Profiles for the eight Finnish vowels are used to form so called anchor points in the articulatory ...

متن کامل

Articulatory synthesis of formant targeted sounds with parameters derived from the inverse solution of speech production

A new approach to produce high fidelity speech sounds by applying both the inverse solution of speech production and the pitchsynchronousarticulatory synthesis technique is presented. Given a formant trace target, the dynamic vocal-tract area function together with time variant VT length are estimated using an inverse solution of speech production. The improved Kelly-Lochbaum filter of the synt...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1994

Improving the kelly-lochbaum vocal tract model using conical tube sections and fractional delay filtering techniques

نویسندگان

چکیده

منابع مشابه

Articulatory speech synthesis based on fractional delay waveguide filters

Articulatory Vocal Tract Synthesis in Supercollider

Mixed physical modeling techniques applied to speech production

Estimation studies of vocal tract shape trajectory using a variable length and lossy kelly-lochbaum model

Articulatory synthesis of formant targeted sounds with parameters derived from the inverse solution of speech production

عنوان ژورنال:

اشتراک گذاری